Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

An Integrated OCR Software for Mathematical Documents and Its Output with Accessibility

Identifieur interne : 001575 ( Main/Exploration ); précédent : 001574; suivant : 001576

An Integrated OCR Software for Mathematical Documents and Its Output with Accessibility

Auteurs : Masakazu Suzuki (mathématicien) [Japon] ; Toshihiro Kanahori [Japon] ; Nobuyuki Ohtake [Japon] ; Katsuhito Yamaguchi [Japon]

Source :

RBID : ISTEX:C3DA26204F8DF9D7EE0B23B4638C57BFACB628A7

Descripteurs français

English descriptors

Abstract

Abstract: This paper describes shortly a practical integrated system for scientific documents including mathematical formulae, named ‘Infty’. The system consists of three components of applications: an OCR system named ‘InftyReader’, an editor named ‘InftyEditor’ and converting tools into various formats. Those applications are linked each other via XML files. InftyReader recognizes scanned images of clearly printed mathematical documents and outputs the recognition results in a XML format. It recognizes complex mathematical formulae used in various research papers of mathematics including matrices. InftyEditor provides a very efficient interface to correct the recognition results using keyboard. Another feature of InftyEditor is its handwriting interface to input mathematical formulae for users with vision and speech interface for visually impaired uses. The XML files output by InftyReader/Editor can be converted into various formats: LATEX, MathML, HTML and Braille Codes; in UBC (Unified Braille Codes) for English texts and in Japanese Braille Codes for Japanese texts.

Url:
DOI: 10.1007/978-3-540-27817-7_97


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">An Integrated OCR Software for Mathematical Documents and Its Output with Accessibility</title>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
</author>
<author>
<name sortKey="Ohtake, Nobuyuki" sort="Ohtake, Nobuyuki" uniqKey="Ohtake N" first="Nobuyuki" last="Ohtake">Nobuyuki Ohtake</name>
</author>
<author>
<name sortKey="Yamaguchi, Katsuhito" sort="Yamaguchi, Katsuhito" uniqKey="Yamaguchi K" first="Katsuhito" last="Yamaguchi">Katsuhito Yamaguchi</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:C3DA26204F8DF9D7EE0B23B4638C57BFACB628A7</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-27817-7_97</idno>
<idno type="url">https://api.istex.fr/document/C3DA26204F8DF9D7EE0B23B4638C57BFACB628A7/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000267</idno>
<idno type="wicri:Area/Istex/Curation">000262</idno>
<idno type="wicri:Area/Istex/Checkpoint">000E10</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Suzuki M:an:integrated:ocr</idno>
<idno type="wicri:Area/Main/Merge">001626</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:04-0407119</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000536</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000254</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000504</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Suzuki M:an:integrated:ocr</idno>
<idno type="wicri:Area/Main/Merge">001715</idno>
<idno type="wicri:Area/Main/Curation">001575</idno>
<idno type="wicri:Area/Main/Exploration">001575</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">An Integrated OCR Software for Mathematical Documents and Its Output with Accessibility</title>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Faculty of Mathematics, Kyushu University, Hakozaki 6-10-1, Higashiku, 812-8581, Fukuoka</wicri:regionArea>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Research Center on Educational Media, Tsukuba College of Technology, 4-12 Kasuga, Tsukuba-shi, 305-0821, Ibaraki</wicri:regionArea>
<wicri:noRegion>Ibaraki</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
<author>
<name sortKey="Ohtake, Nobuyuki" sort="Ohtake, Nobuyuki" uniqKey="Ohtake N" first="Nobuyuki" last="Ohtake">Nobuyuki Ohtake</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Research Center on Educational Media, Tsukuba College of Technology, 4-12 Kasuga, Tsukuba-shi, 305-0821, Ibaraki</wicri:regionArea>
<wicri:noRegion>Ibaraki</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
<author>
<name sortKey="Yamaguchi, Katsuhito" sort="Yamaguchi, Katsuhito" uniqKey="Yamaguchi K" first="Katsuhito" last="Yamaguchi">Katsuhito Yamaguchi</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Junior College Funabashi Campus, Nihon University, 7-24-1, Narashinodai, Funabashi, 274-8501, Chiba</wicri:regionArea>
<wicri:noRegion>Chiba</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2004</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">C3DA26204F8DF9D7EE0B23B4638C57BFACB628A7</idno>
<idno type="DOI">10.1007/978-3-540-27817-7_97</idno>
<idno type="ChapterID">97</idno>
<idno type="ChapterID">Chap97</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Accessibility</term>
<term>Audio acoustics</term>
<term>Braille writing</term>
<term>Character recognition</term>
<term>English</term>
<term>HTML language</term>
<term>Japanese</term>
<term>Keyboard</term>
<term>Latex</term>
<term>Manuscript character</term>
<term>Optical character recognition</term>
<term>Printed character</term>
<term>Printed document</term>
<term>Speech processing</term>
<term>Text</term>
<term>User assistance</term>
<term>User interface</term>
<term>Vision disorder</term>
<term>XML language</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Accessibilité</term>
<term>Acoustique audio</term>
<term>Anglais</term>
<term>Assistance utilisateur</term>
<term>Caractère imprimé</term>
<term>Caractère manuscrit</term>
<term>Clavier</term>
<term>Document imprimé</term>
<term>Ecriture Braille</term>
<term>Interface utilisateur</term>
<term>Japonais</term>
<term>Langage HTML</term>
<term>Langage XML</term>
<term>Latex</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Texte</term>
<term>Traitement parole</term>
<term>Trouble vision</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper describes shortly a practical integrated system for scientific documents including mathematical formulae, named ‘Infty’. The system consists of three components of applications: an OCR system named ‘InftyReader’, an editor named ‘InftyEditor’ and converting tools into various formats. Those applications are linked each other via XML files. InftyReader recognizes scanned images of clearly printed mathematical documents and outputs the recognition results in a XML format. It recognizes complex mathematical formulae used in various research papers of mathematics including matrices. InftyEditor provides a very efficient interface to correct the recognition results using keyboard. Another feature of InftyEditor is its handwriting interface to input mathematical formulae for users with vision and speech interface for visually impaired uses. The XML files output by InftyReader/Editor can be converted into various formats: LATEX, MathML, HTML and Braille Codes; in UBC (Unified Braille Codes) for English texts and in Japanese Braille Codes for Japanese texts.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Japon</li>
</country>
<region>
<li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement>
<li>Fukuoka</li>
</settlement>
<orgName>
<li>Université de Kyūshū</li>
</orgName>
</list>
<tree>
<country name="Japon">
<region name="Kyūshū">
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
</region>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<name sortKey="Ohtake, Nobuyuki" sort="Ohtake, Nobuyuki" uniqKey="Ohtake N" first="Nobuyuki" last="Ohtake">Nobuyuki Ohtake</name>
<name sortKey="Ohtake, Nobuyuki" sort="Ohtake, Nobuyuki" uniqKey="Ohtake N" first="Nobuyuki" last="Ohtake">Nobuyuki Ohtake</name>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<name sortKey="Yamaguchi, Katsuhito" sort="Yamaguchi, Katsuhito" uniqKey="Yamaguchi K" first="Katsuhito" last="Yamaguchi">Katsuhito Yamaguchi</name>
<name sortKey="Yamaguchi, Katsuhito" sort="Yamaguchi, Katsuhito" uniqKey="Yamaguchi K" first="Katsuhito" last="Yamaguchi">Katsuhito Yamaguchi</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001575 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001575 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:C3DA26204F8DF9D7EE0B23B4638C57BFACB628A7
   |texte=   An Integrated OCR Software for Mathematical Documents and Its Output with Accessibility
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024